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This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



INPUT SET: S34756,raw 

^^^ 0 1 2000 



SEQUENCE LISTING 

1 

2 

3 (1) General Information: 

4 

5 

6 

7 

8 

9 
10 
11 

\l (iii) NUMBER OF SEQUENCES: 52 

14 
15 
16 
17 



ENTERED 



(i) APPLICANT: Choulika, Andre 
Perrin, Arnaud 
Dujon, Bernard 
Nicolas, Jean-Francois 

(ii) TITLE OF INVENTION: Nucleotide Sequence Encoding the Enzyme 
I-SCEI and the Uses Thereof 



(iv) CORRESPONDENCE ADDRESS: T.^r-^bnw Garrett & 

(A) ADDRESSEE: Finnegan, Henderson, Farabow, Garrett u 



Dunne r 

(B) STREET: 1300 I Street, N.W. 
^9 (C) CITY: Washington 

20 (D) STATE: D.C. 

21 (E) COUNTRY: USA 

22 (F) ZIP: 20005-3315 

23 
24 
25 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS ^ 

Id) SOFTWARE: Patentin Release #1.0, Version #1.25 



TO (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Unassigned 
^ (B) FILING DATE: 25-JAN-2000 

(C) CLASSIFICATION: 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/244,130 

(B) FILING DATE: 04-FEB-1999 

(vii) PRIOR APPLICATION DATA: 
39 (vii) FK^^ APPLICATION NUMBER: US 09/119,024 

(B) FILING DATE: 20-JUL-1998 



43 (vii) PRIOR APPLICATION DATA: 

II (A) APPLICATION NUMBER: US 08/336,241 

(B) FILING DATE: 07-NOV-1994 



"'1' 0 / asm 

DATE: 02/16/2000'"^^^^0 



RAW SEQUENCE U^JP^^^^^q/.q^ ^97 TIME: 02: 14:57 

PAGE: 2 PATENT APPLICATION US/09/492,697 

INPUT SET: S34756.raw 



- (vii, ----r^-= us 0V/S7X,..0 

^1 • (B) FILING DATE: 05 -NOV- 1992 

II (B) FILING DATE: 05-MAY-1992 

Ts (viii) ATTORNEY/AGENT INFORMATION: 

" ' (A) NAME: Meyers, Kenneth J. 

56 REGISTRATION NUMBER: 25,146 

Ic) SrENCE/DOCKET number: 3495-0111-11 

II (ix) TELECOMMUNICATION INFORMATION: 

^° ^ ' (A) TELEPHONE: 202-408-4000 

" (B) TELEFAX: 202-408-4400. 

63 
64 
65 

In (i) SEQUENCE CHARACTERISTICS: 

II (A) LENGTH: 714 base pairs 

°° (B) TYPE: nucleic acid 

^ (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



69 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
..OCATATGA AAAACATCAA AAAAAACCAG GTAATGAACC TCGGTCCGAA CTCTAAACTG 
CTGAAAGAAT ACAAATCCCA GCTGATCGAA CTGAACATCG AACAGTTCGA AGCAOGTATC 

^rco rr«..cc<=. -c»^- ^.^c cr.c^^^ 

Ts CAGTTCGAGT GGAAAAACAA AOCATACATG GACCACOTAT GTCTOCTGTA CGATCAOTGG 
GTACTOTCCC CGCCOCACAA AAAAGAACGT GTTAACCACC TGGGTAACCT GGTAATCACC 
II .OGGGCGCCC AOACTTTCAA ACACCAAGCT TTCAACAAAC TGGCTAACCT GTTCATCGTT 
11 .ACAACAAAA AAACCATCCC GAACAACCTG GTTGAAAACT ACCTGACCCC GATGTCTCTG 
II CCATACTGOT TCATGGATGA TGGTGGTAAA TOGGATTACA ACAAAAACTC TACCAACAAA 
Ts TCGATCGTAC TGAACACCCA GTCTTTCACT TTCGAAGAAG TAGAATACCT GGTTAAGGGT 
CTGCGTAACA AATTCCAACT GAACTGTTAC GTAAAAATCA ACAAAAACAA ACCGATCATC 
^9 TACATCGATT CTATOTCTTA CCTGATCTTC TACAACCTGA TCAAACCGTA CCTGATCCCG 
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DATE: 02/16/2000 

RAW SEQUENCE LISTING TIME: 02: 14:57 

PAGE: 3 PATENT APPLICATION US/09/492,697 

INPUT SET: S347S6.raw 



III ccTCTOT .c^cocc g;^cact.tc tcctccaa. ctxtcctoa. .t;. 

103 (2) INFORMATION FOR SEQ ID NO: 2: 
^nt (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 
1" (D) T0P01.0GY: linear 

110 (ii) MOtECOLE TYPE: peptide 

111 
112 

111 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

III M.e HI. M.. «. Xi. .V. "° fs' 

117 



714 



SI - - - ^ T "° "° 

i n. ci. =i» 1 =i« f,- 

i n. sL »P f/ - - ^ ^' ™ 

126 5° 

s w= .» «. ■^^ r »" "° °" ^ 

'i» IL p» ». w 

1^-^ 85 ^ 

133 ,11=. fin Thr Phe Lys His Gin Ala Phe Asn 

134 Leu val lie Thr Trp Gly Ala Gin Thr Pne y 

lis "° 

..3 Leu .la Asn Leu Phe He Val .sn Asn Lys Lys Thr He Pro .sn 

138 

ill ^; v.. «n ^ - - ~ "° 

li ..P "p "° "° 

;H r.! V.1 .=u - - - s 

"r, V.1 w= « -V S 

50 

151 , .^3 pro lie lie Tyr He Asp Ser Met Ser Tyr Leu 

152 He Asn Lys Asn Lys pro 
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DATE: 02/16/2000 
TIME: 02:14:58 
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180 
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195 



200 



INPUT SET: S34756.raw 
205 



X Tl. Lvs pro Tyr Leu He Pro Gin Met Met Tyr 
lie Phe Tyr Asn Leu He Lys Pro ly 
210 215 

mv, Tio c;*=>r Ser Glu Thr Phe Leu Lys 
Lys Leu Pro Asn Thr He Ser Ser ^± 

■225 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
;^TAAAA TCATATGAAA AATATTAAAA AAAATCAAGT 
CTAAATTATT AAAAGAATAT AAATCACAAT TAATTGAATT 
CAGGTATTGG TTTAATTTTA GGAGATGCTT ATATTCGTAG 
ATTGTATGCA ATTTGAGTGG AAAAATAAGG CATACATGGA 
ATCAATGGGT ATTATCACCT CCTCATAAAA AAGAAAGAGT 
TAATTACCTG GGGAGCTCAA ACTTTTAAAC ATCAAGCTTT 
TTATTGTAAA TAATAAAAAA CTTATTCCTA ATAATTTAGT 
TGAGTCTGGC ATATTGGTTT ATGGATGATG GAGGTAAATG 
TTAATAAAAG TATTGTATTA AATACACAAA GTTTTACTTT 
TTAAAGGTTT AAGAAATAAA TTTCAATTAA ATTGTTATGT 
CAATTATTTA TATTGATTCT ATGAGTTATC TGATTTTTTA 
TAATTCCTCA AATGATGTAT AAACTGCCTA ATACTATTTC 
AA 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 



AATCAATCTC 
AAATATTGAA 
TCGTGATGAA 
TCATGTATGT 
TAATCATTTA 
TAATAAATTA 
TGAAAATTAT 
GGATTATAAT 
TGAAGAAGTA 
TAAAATTAAT 
TAATTTAATT 
ATCCGAAACT 



GGTCCTATTT 
CAATTTGAAG 
GGTAAAACTT 
TTATTATATG 
GGTAATTTAG ' 
GCTAACTTAT 
TTAACACCTA 
AAAAATTCTC 
GAATATTTAC 
AAAAATAAAC 
AAACCTTATT 
TTTTTAAAAT 
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t * * 

.r^r^.r- DATE: 02/16/2000 

RAW SEQUENCE LISITNG ^- TIME: 02: 14:58 

PAGE: 5 PATENT APPLICATION US/09/492,697 

INPUT SET: S34756.raw 

(D) TOPOLOGY: linear 

208 (ii) MOLECULE TYPE: peptide 

209 
210 

2^2 (Xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: • 

III Met Lys .sn He Lys Lys Asn Oln Val Met .sn Leu Oly Pro Asn Ser 

215 1 ^ 

.ys Leu Leu Lys Olu Tyr Lys Ser Oln Leu He Clu Leu Asn Xle OXu 



218 



224 



20 



... ..1. Glv lie Gly Leu He Leu Gly Asp Ala Tyr He Arg 
220 Gin Phe Glu Ala Gly ne t^xy ^5 

221 



ser .sp Olu Oly Lys Thr Tyr Cys Met OXn Phe OXu Trp Lys .sn 



III Lys Ala Tyr Met Asp His Val Cys Leu Leu Tyr Asp Oln Trp Val Leu 

ii 1 pro pro His Lys Lys Olu Ar. Val Asn His Leu Gly Asn Leu Val 

230 

231 rln Thr Phe Lys His Gin Ala Phe Asn Lys Leu 

,■,2 He Thr Trp Gly Ala Gin Tnr fne i^y 

233 

lit Ala Asn Leu Phe Xle Val Asn Asn Lys Lys Leu Xle Pro Asn Asn Leu 

li val Glu Z Tyr Leu Thr Pro Met Ser Leu Ala Tyr Trp Phe Met Asp 

130 133 

li ASP Gly Gly Lys Trp Asp Tyr Asn Lys Asn Ser Leu Asn Lys Ser Xle 



150 



242 145 

III val Leu Asn Thr Gin Ser Phe Thr Phe Glu Glu Val Cys Tyr Leu Val 

244 1/U 
245 

Lys Gly Leu Arg Asn Lys Phe Gin Leu Asn Cys Tyr Val Lys Xle Asn 



248 
249 

250 Lys Asn 



180 



pro lie Xle Tyr lie Asp Ser Met Ser Tyr Leu Xle Phe 



PAGE: 1 SEQUENCE VERIFICATION REPORT DATE: 02/16/2000 

PATENT APPLICATION USm/492,697 TIME: 02:14:58 

INPUT SET: S34756.raw 



Line 
31 



Error 

Wrong application Serial Number 



Original Text 

(A) APPLICATION NUMBER: Unassigned 



